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SEQUENCE LISTING 



<110> Dunn-Coleman, Nigel 
Goedegebuur, Frits 
Ward, Michael 
Yao, Jian 

<120> BGL4 Beta-Glucosidase and Nucleic Acids 
Encoding the Same 

<130> GC696 

<140> US 10/027,000 
<141> 2001-12-18 

<160>^T) , 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 2976 
<212> DNA 

<213> Trichoderma reesei 



<400> 1 

ttatagtcgc 

ttcctgtgcc 

ctcctcttag 

gtgtgtccaa 

cgtcgccata 

ttcacccgac 

ccatcttgaa 

ggcacacaaa 

acggcgtaag 

cgtcgctcgg 

aagaggccat 

cccctctcgg 

gagctgcggc 

ttttgtgcaa 

ctctccgtga 

cgttcatgac 

ttgatgggat 

gcacatacag 

cacgcttccg 

tcattgacca 

tgacggagaa 

ggaaggttgg 

gcaagaagaa 

gaggctctgc 

agctcgagac 

gcgagcagtg 

cccctggtac 

acctggtgga 

cgtacaccgc 

aggcgtacgt 

ccttcttcgg 

cgtacaagtt 

tcgtccccgg 

aaatcgaaaa- 



tttgttaaat 
tgatgtctat 
tgcttcagtg 
ggtctaccct 
tagagcagca 
ccttgccttg 
gaagctcacc 
ggctctcccc 
agggaccaag 
ttccacattc 
cgctaagagt 
tggacgtggc 
tctcatccgc 
tgatcaggag 
aatctacgca 
ggcgtacaat 
gcttcgaaag 
taccacagaa 
aggagaaaca 
gagggctagg 
cggccccgag 
caacgagggc 
gaagacgctg 
cgcactcagg 
gccgccatcg 
cctcacgccc 
ccctaaccgc 
ctactaccac 
cgacgaggac 
agacgaccag 
ctccgccacc 
caagatcgag 
ccacggctcc 
-g-tccgt.cg.cc 



tggcctcgag 
ctgccgttgg 
acgctaggtt 
gcagtggttt 
tattcttcta 
tcatccagtc 
ctggccgaga 
aagcatggag 
ttcttcaatg 
aaccaaactc 
gcgcatgtga 
ttcgagtcga 
ggcattcaga 
gacaggcgca 
ctcccgttcc 
ggcatcaatg 
gaatggggtt 
gccgttgtgg 
ctcaagttca 
gaagttcttc 
acgactgtca 
atcgtgctgc 
attgtcggcc 
gcctactacg 
tacaccgtcg 
gacggcgctc 
cagcacattg 
cccaaggcgg 
tgcacctacg 
ctcgtcgtcg 
cgcgaggaga 
ttcggctccg 
ctccgcgtcg 
ctcgccaagg 



gtcgacccac 
cctcctcatc 
cggtcacttt 
tgaacccttg 
tctccaaaga 
cttccatcat 
aggtcgatct 
tcccctctct 
gcgtccctgc 
tgctcgaaga 
tcctcggccc 
ttggtgagga 
gcactggagt 
tgatggtgca 
agattgctgt 
gcgtgtcgtg 
gggatggcct 
caggcctcga 
acgtctccaa 
agttcgtcaa 
acaacacccc 
tgaagaacga 
ccaacgccaa 
cagtcactcc 
gcgcctacac 
cgggcatgcg 
acgagctctt 
cagacacgtg 
agctcggcct 
acaacgccac 
cgggccgcat 
cacccaccta 
gcggctgcaa 
-ag.ca.cg.acca- 



gcgtccggct 
ctcatctccc 
gtcccccctt 
atatcctgct 
tcccctcacc 
ggctgatatt 
gctggctggt 
ccgctttaca 
ggcctgcttc 
ggcaggtaag 
gactatcaac 
tccgttcctg 
gcaggctacg 
gagcatcgtc 
gcgagactcc 
cagcgagaac 
aatcatgagc 
cctcgagatg 
cggaaagccc 
gaagtgtgct 
cgaaacggca 
gaacaacgtt 
gcaggccaca 
ctttgacggc 
caccgttcct 
ctggagggtc 
cttcaccaag 
gtacgccgac 
cgtcgtctgc 
caagcaggtc 
caatctcgtc 
caccctcaag 
ggtcattgac 
ggtcatcatc 



gcttgtcccg 


60 


tgttgtctgt 


120 


cgttgctctg 


180 


tgagcatccg 


240 


gagagttcta 


300 


gatgttgagg 


360 


atcgacttct 


420 


gatggcccca 


480 


ccttgcggca 


540 


atgatgggca 


600 


atgcaacgct 


660 


gcgggcttgg 


720 


atcaagcact 


780 


acggagcggg 


840 


cagccgggtg 


900 


cctaaatatc 


960 


gactggtacg 


1020 


cccggacctc 


1080 


tttatccacg 


1140 


gcctccggag 


1200 


gctctcctcc 


1260 


ctgcccttga 


1320 


taccacggcg 


1380 


ctcagcaagc 


1440 


cccattctag 


1500 


ttcaacgagc 


1560 


acggacatgc 


1620 


atggagggca 


1680 


ggcacggcaa 


1740 


cccggcgatg 


1800 


aagggcaaca 


1860 


ggcgacacca 


1920 


gaccaggccg 


1980 


-tgcgcg g g c c _ 


_ .2.0.40. 



1 



ttaacgccga 
tggaccagct 
cgggcacccc 
acggcggcaa 
cgggcaagct 
tccgcaccga 
acgagtttgc 
ttgccttttc 
tgaagaacac 
aagcggccaa 
agcccggcga 
ttgatgagga 
acagcagcgc 
attggtggtc 
gcatatagcg 
aaaaaaaaaa 



ctgggagacc 
cattgccgac 
cgaggagatg 
cgagacgggc 
gtccctcagc 
ggccgggcgc 
cgacaaggac 
caatctctcc 
cggctccgtg 
gattaaccgc 
gacgaaggcg 
gcgggatcag 
agcgaaggat 
tggcgtg taa 
ccgatgaaatf 
aaaaaaaaaa 



gagggcgccg 
gtggccgccg 
ccctggctcg 
aactccattg 
ttccccaagc 
acgctgtacg 
gtcaatttcc 
gtgtctcaca 
cccggcgcac 
cccgtcaagg 
gtgacaatcg 
tggtgtgtcg 
ggggttgcgc 
agtcgtgcat 
gagaaccaac 
aaaaaaaaaa 



accgcgcgag 
cgaacccaaa 
acgccacgcc 
ccgacgtcgt 
gcctgcagga 
gcgaggacgt 
cctttggcca 
aggacggcaa 
aggtggccca 
agctcaaggg 
aggagcagga 
aaaagggtga 
tcaggggtaa 
catctttggc 
gatcattgtg 
aaaaaa - ■> 



catgaagctc 
caccgtcgtc 
cgccgtcatc 
ctttggcgac 
caaccccgcg 
ctacgtcggg 
cggcctgtcc 
gctgagcgtg 
gctctacgtc 
cttcgcaaag 
gaagtacgtc 
ctatgaggtt 
gtttacggtg 
agattgaatc 
taaaaaaaaa 



cccggcgtgc 
gtcatgcaga 
caggcctggt 
tacaacccct 
tttctcaact 
tacaggtact 
tacaccactt 
tccctctccg 
aagcccctcc 
gtcgaactgc 
gctgcgtatt 
atcgtgagcg 
ggagagacgt 
cagtcacttt 
aaaaaaaaaa 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
28 80 
2940 
2976 



<210> 2 
<211> 833 
<212> PRT 

<213> Trichoderma reesei 



<400> 2 




























Met 


Ala 


Asp 


Ile 


Asp 


Val 


Glu 


Ala 


He 


Leu 


Lvs 


Lys Leu 


Thr 


Leu 


Ala 


1 








5 










10 








15 




Glu 


Lys 


Val 


Asp 


Leu 


Leu 


Ala 


Gly 


He 


Asp 


Phe 


Trp His 


Thr 


Lys 


Ala 


















25 








30 






Leu 


Pro 


Lys 


His 


Gly 


Val 


Pro 


Ser 


Leu 


Arg 


Phe 


Thr Asp 


Gly 


Pro 


Asn 
















40 








45 








Gly 


Val 


Arg 


Gly 


Thr 


Lys 


Phe 


Phe 


Asn 


Glv 


Val 


Pro Ala 


Ala 


Cys 


Phe 




50 










55 










60 








Pro 


Cys 


Gly 


Thr 


Ser 


Leu 


Gly 


Ser 


Thr 


Phe 


Asn 


Gin Thr 


Leu 


Leu 


Glu 


65 








70 










75 








80 


Glu 


Ala 


Gly 


Lys 


Met 


Met 


Gly 


Lys 


Glu 


Ala 


He 


Ala Lys 


Ser 


Ala 


His 










85 










90 








95 




Val 


He 


Leu 


Gly 


Pro 


Thr 


He 


Asn 


Met 


Gin 


Arg 


Ser Pro 


Leu 


Gly 


Gly 








100 










105 








110 






Arg 


Gly 


Phe 


Glu 


Ser 


He 


Gly 


Glu Asp 


Pro 


Phe 


Leu Ala 


Gly 


Leu 


Gly 






115 










120 








125 








Ala 


Ala 


Ala 


Leu 


He 


Arg 


Gly 


lie 


Gin 


Ser 


Thr 


Gly Val 


Gin 


Ala 


Thr 




130 










135 










140 








He 


Lys 


His 


Phe 


Leu 


Cys 


Asn 


Asp 


Gin 


Glu 


Asp 


Arg Arg Met 


Met 


Val 


145 










150 










155 








160 


Gin 


Ser 


He 


Val 


Thr 


Glu 


Arg 


Ala 


Leu 


Arg 


Glu 


He Tyr Ala 


Leu 


Pro 










165 










170 








175 




Phe 


Gin 


He 


Ala 


Val 


Arg Asp 


Ser 


Gin 


Pro 


Gly 


Ala Phe 


Met 


Thr 


Ala 








180 










185 








190 






Tyr 


Asn 


Gly 


He 


Asn 


Gly Val 


Ser 


Cys 


Ser 


Glu 


Asn Pro 


Lys 


Tyr 


Leu 






195 










200 








205 








Asp 


Gly 


Met 


Leu 


Arg 


Lys 


Glu 


Trp 


Gly 


Trp 


Asp 


Gly Leu 


He 


Met 


Ser 




210 










215 










220 








Asp 


Trp 


Tyr 


Gly 


Thr 


Tyr 


Ser 


Thr 


Thr 


Glu 


Ala 


Val Val 


Ala 


Gly 


Leu 


225 










230 










235 








240 


Asp 


Leu 


Glu 


Met 


Pro 


Gly 


Pro 


Pro 


Arg 


Phe 


Arg 


Gly Glu 


Thr 


Leu 


Lys 










245 










250 








255 




Phe 


Asn 


Val 


Ser 


Asn 


Gly 


Lys 


Pro 


Phe 


He 


His 


Val He 


Asp 


Gin 


Arg 








260 










265 








270 






Ala 


Arg 


Glu 


Val 


Leu 


Gin 


Phe 


Val 


Lys 


Lys 


Cys 


Ala Ala 


Ser 


Gly Val 






275 










280 








285 








-Thr 


-Gl-u 


Asn- 


Gly 


Pro 


Glu 


-T-hr- 


Thr 


Val 


As n. As n 


-Thr P-ro- 


Glu- 


-Thr- Ala 



2 






290 










295 










300 










Ala 


Leu 


Leu 


Arq 


Lys 


Val 


Gly 


Asn 


Glu 


Gly 


He 


Val 


Leu 


Leu 


Lys 


Asn 


305 










310 










315 










320 


Glu 


Asn 


Asn 


Val 


Leu 


Pro 


Leu 


Ser 


Lvs 


Lvs 


Lvs 


Lvs 


Thr 


Leu 


He 


Val 










325 










330 










335 




Gly 


Pro 


Asn 


Ala 


Lys 


Gin 


Ala 


Thr 


Tyr 


His 


Gly 


Gly 


Gly 


Ser 


Ala 


Ala 








340 










345 










350 






Leu 


Arq 


Ala 


Tyr 


Tyr 


Ala 


Val 


Thr 


Pro 


Phe 


Asp 


Gly 


Leu 


Ser 


Lys 


Gin 






355 










360 










365 








Leu 


Glu 


Thr 


Pro 


Pro 


Ser 


Tyr 


Thr 


Val 


Gly 


Ala 


Tyr 


Thr 


Thr 


Val 


Pro 




370 










375 










380 










Pro 


He 


Leu 


Gly 


Glu 


Gin 


Cys 


Leu 


Thr 


Pro 


Asp 


Gly 


Ala 


Pro 


Gly Met 


385 










390 










395 










400 


Arg 


Trp 


Arg 


Val 


Phe 


Asn 


Glu 


Pro 


Pro 


Gly 


Thr 


Pro 


Asn 


Arg 


Gin 


His 










405 










410 










415 




He 


Asp 


Glu 


Leu 


Phe 


Phe 


Thr 


Lys 


Thr 


Asp 


Met 


His 


Leu 


Val 


Asp 


Tyr 








420 










425 










430 






Tvr 


His 


Pro 


Lvs 


Ala 


Ala 


Asp 


Thr 


Trp 


Tyr 


Ala 


Asp 


Met 


Glu 


Gly Thr 






435 










440 










445 








Tyr 


Thr 


Ala 


Asp 


Glu 


Asp 


Cys 


Thr 


Tyr 


Glu 


Leu 


Gly 


Leu 


Val 


Val 


Cys 




450 










455 










460 










Gly 


Thr 


Ala 


Lys 


Ala 


Tyr 


Val 


Asp 


Asp 


Gin 


Leu 


Val 


Val 


Asp 


Asn 


Ala 


465 










470 










475 










480 


Thr 


Lys 


Gin 


Val 


Pro 


Gly 


Asp 


Ala 


Phe 


Phe 


Gly 


Ser 


Ala 


Thr 


Arg 


Glu 










485 










490 










495 




Glu 


Thr 


Gly 


Arg 


He 


Asn 


Leu 


Val 


Lys 


Gly 


Asn 


Thr 


Tyr 


Lys 


Phe 


Lys 








500 










505 










510 






He 


Glu 


Phe 


Gly 


Ser 


Ala 


Pro 


Thr 


Tyr 


Thr 


Leu 


Lys 


Gly 


Asp 


Thr 


He 






515 










520 










525 








Val 


Pro 


Gly 


His 


Gly 


Ser 


Leu 


Arg 


Val 


Gly 


Gly 


Cys 


Lys 


Val 


He Asp 




530 










535 










540 










Asp 


Gin 


Ala 


Glu 


He 


Glu 


Lvs 


Ser 


Val 


Ala 


Leu 


Ala 


Lvs 


Glu 


His 


Asp 


545 










550 










555 










560 


Gin 


Val 


He 


He 


Cys 


Ala 


Gly 


Leu 


Asn 


Ala 


Asp 


Trp 


Glu 


Thr 


Glu 


Gly 










565 










570 










575 




Ala 


Asp 


Arg 


Ala 


Ser 


Met 


Lys 


Leu 


Pro 


Gly 


Val 


Leu 


Asp 


Gin 


Leu 


He 








580 










585 










590 






Ala 


Asp 


Val 


Ala 


Ala 


Ala 


Asn 


Pro 


Asn 


Thr 


Val 


Val 


Val 


Met 


Gin 


Thr 






595 










600 










605 








Gly 


Thr 


Pro 


Glu 


Glu 


Met 


Pro 


Trp 


Leu 


Asp 


Ala 


Thr 


Pro 


Ala 


Val 


He 




610 










615 










620 










Gin 


Ala 


Trp 


Tyr 


Gly 


Gly 


Asn 


Glu 


Thr 


Gly 


Asn 


Ser 


He 


Ala 


Asp 


Val 


625 










630 










635 










640 


Val 


Phe 


Gly 


Asp 


Tyr 


Asn 


Pro 


Ser 


Gly 


Lys 


Leu 


Ser 


Leu 


Ser 


Phe 


Pro 










645 










650 










655 




Lvs 


Arg 


Leu 


Gin 


Asp 


Asn 


Pro 


Ala 


Phe 


Leu 


Asn 


Phe 


Arg 


Thr 


Glu 


Ala 








660 










665 










670 






Gly 


Arg 


Thr 


Leu 


Tyr 


Gly 
-* 


Glu 


Asp 


Val 


Tyr 


Val 


Gly 


Tyr 


Arg 


Tyr Tyr 






675 










680 










685 








Glu 


Phe 


Ala 


Asp 


Lvs 


Asp 


Val 


Asn 


Phe 


Pro 


Phe 


Gly 


His 


Gly 


Leu 


Ser 




690 










695 










700 










Tyr 


Thr 


Thr 


Phe 


Ala 


Phe 


Ser 


Asn 


Leu 


Ser 


Val 


Ser 


His 


Lvs 


Asp 


Gly 


705 










710 










715 










720 


Lys 


Leu 


Ser 


Val 


Ser 


Leu 


Ser 


Val 


Lys 


Asn 


Thr 


Gly 


Ser 


Val 


Pro 


Gly 










725 










730 










735 




Ala 


Gin 


Val 


Ala 


Gin 


Leu 


Tyr 


Val 


Lys 


Pro 


Leu 


Gin 


Ala 


Ala 


Lys 


He 








740 










745 










750 






Asn 


Arg 


Pro 


Val 


Lys 


Glu 


Leu 


Lys 


Gly 


Phe 


Ala 


Lys 


Val 


Glu 


Leu 


Gin 






755 










760 










765 








Pro 


Gly 


Gl.u_ 


. Thr. 


.Lys. 


Ala. 


.Val 


Thr 


_I.l_e 


Glu. 


. Glu. 


_Gln_ 


_Gl.u 


_ Lys_ 


T_yr_ Val 



770 

Ala Ala Tyr Phe Asp 
785 

Asp Tyr Glu Val He 
805 

Ala Leu Arg Gly Lys 
820 

Val 



775 

Glu Glu Arg Asp Gin Trp 
790 ~ * 795 

Val Ser Asp Ser Ser Ala 
810 

Phe Thr Val Gly Glu Thr 
825 



780 

Cys Val Glu Lys Gly 
800 

Ala Lys Asp Gly Val 
815 

Tyr Trp Trp Ser Gly 
830 



<210> 3 
<211> 2502 
<212> DNA 

<213> Trichoclerma reesei 



<400> 3 

atggctgata ttgatgttga ggccatcttg aagaagctca ccctggccga gaaggtcgat 60 

ctgctggctg gtatcgactt ctggcacaca aaggctctcc ccaagcatgg agtcccctct 120 

ctccgcttta cagatggccc caacggcgta agagggacca agttcttcaa tggcgtccct 180 

gcggcctgct tcccttgcgg cacgtcgctc ggttccacat tcaaccaaac tctgctcgaa 240 

gaggcaggta agatgatggg caaagaggcc atcgctaaga gtgcgcatgt gatcctcggc 300 

ccgactatca acatgcaacg ctcccctctc ggtggacgtg gcttcgagtc gattggtgag 360 

gatccgttcc tggcgggctt gggagctgcg gctctcatcc gcggcattca gagcactgga 420 

gtgcaggcta cgatcaagca ctttttgtgc aatgatcagg aggacaggcg catgatggtg 480 

cagagcatcg tcacggagcg ggctctccgt gaaatctacg cactcccgtt ccagattgct 540 

gtgcgagact cccagccggg tgcgttcatg acggcgtaca atggcatcaa tggcgtgtcg 600 

tgcagcgaga accctaaata tcttgatggg atgcttcgaa aggaatgggg ttgggatggc 660 

ctaatcatga gcgactggta cggcacatac agtaccacag aagccgttgt ggcaggcctc 720 

gacctcgaga tgcccggacc tccacgcttc cgaggagaaa cactcaagtt caacgtctcc 780 

aacggaaagc cctttatcca cgtcattgac cagagggcta gggaagttct tcagttcgtc 840 

aagaagtgtg ctgcctccgg agtgacggag aacggccccg agacgactgt caacaacacc 900 

cccgaaacgg cagctctcct ccggaaggtt ggcaacgagg gcatcgtgct gctgaagaac 960 

gagaacaacg ttctgccctt gagcaagaag aagaagacgc tgattgtcgg ccccaacgcc 1020 

aagcaggcca cataccacgg cggaggctct gccgcactca gggcctacta cgcagtcact 1080 

ccctttgacg gcctcagcaa gcagctcgag acgccgccat cgtacaccgt cggcgcctac 1140 

accaccgttc ctcccattct aggcgagcag tgcctcacgc ccgacggcgc tccgggcatg 1200 

cgctggaggg tcttcaacga gccccctggt acccctaacc gccagcacat tgacgagctc 1260 

ttcttcacca agacggacat gcacctggtg gactactacc accccaaggc ggcagacacg 1320 

tggtacgccg acatggaggg cacgtacacc gccgacgagg actgcaccta cgagctcggc 1380 

ctcgtcgtct gcggcacggc aaaggcgtac gtagacgacc agctcgtcgt cgacaacgcc 1440 

accaagcagg tccccggcga tgccttcttc ggctccgcca cccgcgagga gacgggccgc 1500 

atcaatctcg tcaagggcaa cacgtacaag ttcaagatcg agttcggctc cgcacccacc 1560 

tacaccctca agggcgacac catcgtcccc ggccacggct ccctccgcgt cggcggctgc 1620 

aaggtcattg acgaccaggc cgaaatcgaa aagtccgtcg ccctcgccaa ggagcacgac 1680 

caggtcatca tctgcgcggg ccttaacgcc gactgggaga ccgagggcgc cgaccgcgcg 1740 

agcatgaagc tccccggcgt gctggaccag ctcattgccg acgtggccgc cgcgaaccca 1800 

aacaccgtcg tcgtcatgca gacgggcacc cccgaggaga tgccctggct cgacgccacg 1860 

cccgccgtca tccaggcctg gtacggcggc aacgagacgg gcaactccat tgccgacgtc 1920 

gtctttggcg actacaaccc ctcgggcaag ctgtccctca gcttccccaa gcgcctgcag 1980 

gacaaccccg cgtttctcaa cttccgcacc gaggccgggc gcacgctgta cggcgaggac 2040 

gtctacgtcg ggtacaggta ctacgagttt gccgacaagg acgtcaattt cccctttggc 2100 

cacggcctgt cctacaccac ttttgccttt tccaatctct ccgtgtctca caaggacggc 2160 

aagctgagcg tgtccctctc cgtgaagaac accggctccg tgcccggcgc acaggtggcc 2220 

cagctctacg tcaagcccct ccaagcggcc aagattaacc gccccgtcaa ggagctcaag 2280 

ggcttcgcaa aggtcgaact gcagcccggc gagacgaagg cggtgacaat cgaggagcag 2340 

gagaagtacg tcgctgcgta ttttgatgag gagcgggatc agtggtgtgt cgaaaagggt 2400 

gactatgagg ttatcgtgag cgacagcagc gcagcgaagg atggggttgc gctcaggggt 2460 

aagtttacgg tgggagagac gtattggtgg tctggcgtgt aa 2502 



4 



